10 30 50 

TTCGGGCACGAGGGCAGGATGGCGCCACCACCAGCTAGAGTACATCTAGGTGCGTTCCTG 

MA PPPARVHLGAFt 
70 90 110 

GCAGTGACTCCGAATCCCGGGAGCGCAGCGAGTGGGACAGAGGCAGCCGCGGC.CACACCC 
AVTP NPGSA ASGTEAAAATP 

130 150 170 

AGCAAAGTGTGGGGCTCTTCCGCGGGGAGGATTGAACCACGAGGCGGGGGCCGAGGAGCG 
SKVWGSSAG RIE PRGGGRGA 

190 .210 230 

CTCCCTACCTCCATGGGACAGCACGGACCCAGTGCCCGGGCCCGGGCAGGGCGCGCCCCA 
L P I S M G Q H G P SARA R A G R A P 

250 270 290 

GGACCCAGGCCGGCGCGGGAAGCCAGCCCTCGGCTCCGGGTCCACAAGACCTTCAAGTTT 
G P R P AR E A S P R L R V H KT FK F 

310 330 350 

GTCGTCGTCGGGGTCCTGCTGCAGGTCGTACCTAGCTCAGCTGCAACCATCAAACTTCAT 
VVVGVLLQVVPSSAATIKLH 

370 390 410 

GATCMTCMTTGGCACACAGCAATGGGAACATAGCCCTTTGGGAGAGTTGTGTCCACCA 
DQS IGTQQWEHSP LGELCPP 

430 450 470 

GGATCTCATAGATCAGAACGTCCTGGAGCCTGTAACCGGTGCACAGAGGGTGTGGGTTAC 
GS_H R S E RP GAC NRCTEGVGY 

490 510 530 

ACCMTGCTTCCMCMTTTGTTTGCTTGCCTCCCATGTACAGCTTGTAAATCAGATGAA 
TNAS N N L FAC LP CTAC KSD E 

550 570 590 

GAAGAGAGAAGTCCCTGCACCACGACCAGGAACACAGCATGTCAGTGCAAACCAGGAACT 
EERSPCTTTRNTACQCKPGT 

610 630 650 

TTCCGGAATGACAATTCTGCTGAGATGTGCCGGAAGTGCAGCACAGGGTGCCCCAGAGGG 
FRNDNSAEMCRKCSTGCPR G 

670 690 710 

ATGGTCAAGGTCAAGGATTGTACGCCCTGGAGTGACATCGA.GTGTGTCCACAAAGAATCA 
MVKVKOCTPWSOIECVHKES 

FIG.1A 



730 750 . 770 

GGCMTGGACATMTATATGGGTGATTTTGGTTGTGACTTTGGTTGTTCCGTTGCTGTTG 
GNG HN IWV I LVVT L VV PLLL 

A A A, A A AAAAAAA AAAA AA A A AAA A A A A A A A A A A AAAAA A A A A A** 

790 • 810 830 

GTGGCTGTGCTGATTGTCTGTTGTTGCATCGGCTCAGGTTGTGGAGGGGACCCCAAGTGC 
VAVL IVGCC I GSGC G G D P K C 

■A A- A A A A A A A A A A A A A A A A A A A A A A A A A A A A A A A A A A A - A - A - A 

850 870. 890 

ATGGACAGGGTGTGTTTCTGGCGCTTGGGTCTCCTACGAGGGCCTGGGGCTGAGGACAAT 
M D R V C F W.R L G L L R G P G A E 0 N 

910 930 950 

GCTCACAACGAGATTCTGAGCAACGCAGACTCGCTGTCCACTTTCGTCTCTGAGCAGCAA 
A H N E I L S N A D S L S T F V S E Q Q 

970 990 1010 

ATGGAAAGCCAGGAGCCGGCAGATTTGACAGGTGTCACTGTACAGTCCCCAGGGGAGGCA 
MESQE PAE LT GVTVQSP GEA 

1030 1050 1070 • 

CAGTGTCTGCTGGGACCGGCAGAAGCTGAAGGGTCTCAGAGGAGGAGGCTGCTGGTTCCA 
QC LLG PA EA EGSQRR.RL LVP 

1090 1110 1130 

GCAMTGGTGCTGACCCCACTGAGACTCTGATGCTGTTCTTT(^CMGTTTGCAAACATC 
ANG ADP TETLMLFFDKFANI 

1150 1170 1190 

GTGCCCTTTGACTCCTGGGACCAGCTCATGAGGCAGCTGGACCTCACGAAAAATGAGATC 
V P F0 SWDQLMRQLDL TKNEI 

1210 1230 1250 

GATGTGGTCAGAGCTGGTACAGCAGGCCCAGGGGATGCCTTGTATGCAATGCTGATGAAA 
DVVRAGTAGPGDAL Y A M L M K 

1270 1290. 1310 

TGGGTCAACAAAACTGGACGGAACGCCTCGATCCACACCCTGCTGGATGCCTTGGAGAGG 
WVNKTGRNA S I H T L L DA L E R 

1330 1350 . 1370 

ATGGMGAGAGACATGCAAMGAGMGATTCAGGACCTCnGGTGGACTCTGGAMGTTC 
/HEERHA/CEKIQOLLVQSGKF 
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1390 1410. . 1430 

ATCTACTTAGMGATGGCACAGGCTCTGCCGTGTCCTTGGAGTGAMGACTCTTTTTACC 
IYLEDGTGSAVSLE 

1450 1470 1490 

AGAGGTTTCCTCTTAGGTGTTAGGAGTTAATACATATTAGG 1 1 I I I I I I I I I FTTAACAT 

1510 1530 1550 . 

GTATACAMGTAMTTCTTAGCCACGTGTATTGGCTCCTGCCTGTMTCCCATCACTTTG 

1570 1590 1610 

GGAGGCTGACGCCGGTGGATCCACTTGAGGTCCGAAGTTCCAAGACCAGCCCTGAACCAA 

1630 1650 1670 

CATCGTGGAMTGCCCGTCTTTTACAAAAAAATACCAAAAATTCAACTGGAATGTGCATG 

1690 1710 1730 

GTGTGTGCCATCATTTCCTCGGCTAACTACGGGAGGTCTGAGGCCAGGAGAATCCACTTG 

1750 1770 1790 

AACCCCACGAAGGACAGTGTAGACTGCAGATTGCACCACTGCACTCCCAGCCTGGGAACA 

1810 1830 • ' ! 1850 

CAGAGCMGACTCTGTCTCMGATAAMTAAMTAMCTTGAAAGAATTATTGCCCGACT 
1870 1890 1910 

GAGGCTCACATGCCAAAGGAAAATCTGGTTCTCCCCTGAGCTGGCCTCCGTGTGTTTCCT 
1930^ 1950 . 1970 

TATCATGGTGGTCMTTGGAGGTGTTAATTTGAATGGATTAAGGAACACCTAGAACACTG 
1990 2010 2030 

GTMGGCATTATTTCTGGGACATTATTTCTGGGCATGTCTTCGAGGGTGTTTCCAGAGGG 
2050 2070 2090 

GATTGGCATGCGATCGGGTGGACTGAGTGGAAAAGACCTACCCTTAATTTGGGGGGGCAC 
2110 2130 2150 

C GTCC GAC AGACTGGGGAGCMGATAGMGAAMC AAAAAAAAAAAAAAAAA 

FIG.1C 
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HTOIY07R 



1 GGCANAGGTN CGTACCTAGC TCACCTGCAA CCATCAAACT TNATGATCAA 

51 TCAATTGGCA CACAGCAATG GGAAACATAG CCCTTTGGAA GANTTGTNTG 

101 CACCAGGATC TCATAGATCA AAACATCCTG GGAGCCTGTT AACCGGTGCC 

151 CCAAAGGNTG GTCAAGGTCA AGGAATTGTT NCGCCCTGGA AGTGAACATC 

201 GAGTGTNTCC ACAAAGGATT CAGGCAATGG GACATAAATA TATGGGTGM 

251 TTTTGGTTGT GAACTTTGGT TGNTCCCGTT GNTGTTGNTG GCTGTGCTGA 

301 TTGTTTGTTG TTGCATCGGC TTCAGGTTNT GGAGGGGGAC CCAAGTGCAT 

351 GGACAGGGTG TGTTTCTGGG GTTTGGGTCT CTTAGAGGGC NTGGGTTANG 

401 GCANGTTCAC AAGGGTTTTA GCAANG 



HTXEY80R 



1 


TGGGGCTGAG 


GACAATGCTG 


51 


CTGTCCACTT 


TCGTCTNTGN 


101 


TTTGACAGGT 


GTCACTGTAC 


151 


TGAGTTGGGG 


ACAGGCCCTT 


201 


CATGNCTCGG 


CTTCNNNTGG 


251 


GGACTTNAGG 


GGGACGGNGC 


301 


AAGGGGGACA 


GGCAI M 1 IN 



ACNACGAGAT TCTGAGCAAC GCAGNACTNG 
GCAGCAAATG GAAAGCGAGG AGCCGGCAGA 
AGTCCCCAGG GGAGGCACAG TGTCTGCTGG 
GCAAGACCTT GTGAGGCAGG GGGTGAAGGC 
TCAAAGGGGA AGTGGAGCCT GAGGGAGATG 
TGCGTGGGGA AAAAGCAGCC ACCNTTTGAC 
CAAATGTGTG CTTNTTGGT 
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